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Abstract — Algebraic decoding algorithms are commonly ap- 
plied for the decoding of Reed-Solomon codes. Their main 
advantages are low computational complexity and predictable 
decoding capabilities. Many algorithms can be extended for 
correction of both errors and erasures. This enables the decoder 
to exploit binary quantized reliability information obtained from 
the transmission channel: Received symbols with high reliability 
are forwarded to the decoding algorithm while symbols with 
low reliability are erased. In this paper we investigate adaptive 
single-trial error/erasure decoding of Reed-Solomon codes, i.e. 
we derive an adaptive erasing strategy which minimizes the 
residual codeword error probability after decoding. Our result is 
applicable to any error/erasure decoding algorithm as long as its 
decoding capabilities can be expressed by a decoder capability 
function. Examples are Bounded Minimum Distance decoding 
with the Berlekamp-Massey- or the Sugiyama algorithms and 
the Guruswami-Sudan list decoder. 

I. Introduction 

Using algebraic error/erasure decoders for pseudo-soft de- 
coding of Reed-Solomon (RS) codes dates back to Forney 
ID, Q. His Generalized Minimum Distance (GMD) decoding 
applies an error/erasure decoder multiple times, each time 
with an increased number of erased most unreliable symbols 
of the received word. In very good channels, the residual 
codeword error probability of GMD decoding approaches that 
of Maximum Likelihood (ML) decoding if the number z of 
such decoding trials is sufficiently large, i.e. z ps d m ; n /2, 
where d m - m is the minimum Hamming distance of the RS 
code. Thus, the computational complexity of GMD decoding 
is d m in/2-times that of errors-only decoding. Roughly, we 
can say d m \ n £ 0(n), which means that quadratic decoding 
complexity in the code length n becomes cubic and so on. 
Kotter [3] provided a modification of the Berlekamp-Massey 
algorithm that essentially computes all decoding trials at 
once, i.e. without increasing complexity. A similar result was 
achieved in j4), [0 for the Euclidean algorithm and in J6] 
using Newton interpolation. Up to our knowledge, there is no 
such modification of Guruswami-Sudan (GS) [7 | list decoding 
- which can be considered as state of the art in algebraic 
decoding - so far. However, Kotter and Vardy provided a 
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modification of the GS algorithm that is capable of exploiting 
soft information [ 8 1 . 

GMD decoding is per se a fixed approach, i.e. the erasing 
strategy is constant for each received word. Adaptive variants 
have been investigated in S-JTT]. The respective authors 
show that the number z of decoding trials can be reduced 
significantly, if the erasing strategy is optimally calculated for 
every single received word. The aforementioned papers focus 
on the maximization of the achievable decoding radius, i.e. the 
maximum correctable number of errors in the received word. 
In contrast to that, our objective is to minimize the residual 
codeword error probability after decoding. We achieve this 
using a technique first introduced in 2010 lfl2l for optimal 
error/erasure decoding of binary codes. As in the latter paper, 
we restrict ourselves to one single decoding trial, i.e. z = 1, 
for simplicity. 

The rest of the paper is organized as follows. In Section [II] 
we give an overview of error/erasure decoding and introduce 
the required notations. We introduce the decoder capability 
function which allows to derive the optimal erasing strategy 
in a general manner. Here and in the following, optimal 
means minimizing the residual codeword error probability. In 
Section [HI] we derive an optimal adaptive erasing strategy for 
one single decoding trial. Section HVl describes two computa- 
tionally efficient approximations of the optimal strategy, one 
of them with complexity quadratic in the code length n. In 
Section [V] we show the potential of single-trial error/erasure 
decoding in terms of achievable residual codeword error prob- 
ability as well as the quality of the two approximative variants 
by simulation. The paper is wrapped up with conclusions and 
an outlook in Section |VI] 

II. Reliability-Based Error/Erasure Decoding 
and Decoder Capability Functions 

Consider the RS code 7ZS(q; n, k, d m i n ) of length n, dimen- 
sion k and minimum distance <i m i n over the extension field ¥ q . 
Thereby, q := p' m for some prime number p and some integer 
m. The transmitter encodes an information vector a€Fj into 
a codeword c := (cq, . . . , c n -i) € 1ZS C F™. Each symbol Cj, 
i = 0, . . . , n — 1, is mapped to a symbol xi of a modulation 
alphabet A C R 2 resulting in the modulated codeword x := 
(xoi • ■ • , %n-x) € -4™- This modulated codeword is transmitted 
over the channel, where it is distorted by two-dimensional 



Additive White Gaussian Noise (AWGN). At the receiver, the 
received word x + e = (yo, • • • ,Vn-i) = : y € (R 2 ) is 
obtained. It is the sum of the modulated codeword and an 
error word (eo, . . . , e n -i) ='■ e G (K 2 ) . Each symbol yj 
is mapped to the closest (in Euclidean metric) modulation 
point of A, the result of this procedure is the hard decision 
x := (xq, . . . , x n -i) G A n of y. This hard decision can be fed 
into the inverse mapper function to obtain a received vector 
r := (ro, . . . , r n -i) G F™, which in turn can be fed into any 
algebraic hard decision decoder for US. 

The hard decision, i.e. the mapping from received symbols 
Ui to closest modulation points Xi, is error-prone since it is 
not necessarily correct: The received j/j = Xi + might be 
closest in Euclidean metric to an x\ 7^ x^. We refer to the 
probability of an incorrect hard decision as unreliability of a 
received symbol and denote it by h a (xi). Here, 
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is the AWGN standard deviation for E^/Nq given in dB. 
Note that due to the one-to-one mapping between X{ and 
r-j we can as well write the unreliability as a function of the 
de-modulated received symbols, i.e. h a (ri) := h a (xi). By 
definition, we have 

h a {ri) = 1 — Pr(5i transmitted | y received) 
Pr(y I Xi) Pr(ii) 



= 1 
= 1 
= 1 



Pr(y) 
Pvjy I Xi)PT(xj) 
E, e ^Pr(y \x) Pr(x) 
Pr(y I Xi) 



(i) 



where the last equality follows from the assumption of 
equiprobable codeword symbols. In practice, the calculation 
of (Q]) is not feasible for large modulation alphabets A, hence 
we use the nearest neighbor approximation 



h a (n) 
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Pr(y I Xi) 
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(2) 



where B(xi) C A is the set of nearest neighbors of X{. This 
allows to store all possible values of h a (ri) in a comparatively 
small lookup table as the following example demonstrates. 

Assume 256-Quadrature Amplitude Modulation ( QAM) 
with average signal energy one, Gray mapping and 8— bit- 
quantization. Without nearest neighbor approximation, this 
leads to a lookup table with 65536/4 = 16348 entries, 
each containing two integers for the coordinates and one 
real number for the unreliability value h a (ri). Considering 
only the nearest neighbors, the lookup table consists of only 
64 + 128 + 128 = 320 such entries if symmetries within 
the QAM decision regions are exploited. This allows to store 
lookup tables for many different E^/No values, e.g. up to 
the precision of the (required) channel estimation. Fig. Q] 
shows a density plot of h a (ri) for the complete Euclidean 



plane, the black dots mark the modulation points, darker color 
marks higher unreliability. Note that the unreliabilities for most 
decision regions are either rotated versions of each other or 
they coincide. Furthermore, the unreliabilities of the decision 
regions are either point symmetric to their modulation point 
(regions in the center) or symmetric to a line through the 
modulation point (border and corner regions). This allows to 
discard 3/4 of the quantization intervals in the first case and 
1/2 of the intervals in the second case. 
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Fig. 1. Unreliability h a (ri) for the Euclidean plane, 256-QAM, 8— bit- 
quantization, AWGN@18 dB. 

At the receiver, the unreliability is calculated (or taken 
from the lookup table) for every received symbol r^, i = 
0, . . .,n — 1. W.l.o.g. we assume here and in the following 
that the received word r is ordered according to its symbol's 
unreliabilities, i.e. h a (ro) > ■ ■ ■ > h a (r n -i). The idea of 
error/erasure decoding is to discard the r most unreliable 
symbols (i.e. to erase them) since it is likely that they are 
erroneous. Instead of r, the input vector fed into the algebraic 
decoder is then 



r r = (X, 



l). 



where the first r symbols are replaced by the erasure marker X. 
In order to do this, two conditions need to be fulfilled. First, we 
require an algebraic decoder which is capable of decoding both 
errors and erasures. Second, we must be capable of deciding 
how many of the most unreliable symbols should be discarded. 

Algebraic error/erasure decoders for RS codes are well- 
known. Classical Bounded Minimum Distance (BMD) e.g. 
decoding using the Berlekamp-Massey- or the Sugiyama 
algorithms can be augmented by an erasure option |[T3l as 
can the GS list decoder Q. In |[T4l . a decoder with erasure 
option for Interleaved Reed-Solomon (IRS) codes from lfT5l 
is applied to decode l-punctured RS codes. 

The decoder capability function (DCF) of an algebraic 
enor/erasure decoder is an inequality of the form 

f(n,£,r) >k-l, 

which is true whenever the decoder can correct e errors and r 
erasures in any given received word. Three important examples 



Decoder 



Bounded Minimum Distance 



f(n,e,r) 



■ 2e 



£o(t) 



n — k + 1 — - 



IRS-based (for ^-punctured RS codes) 



i(n - k + 1 - r) 



- 1 



Guruswami-Sudan, ^ — > oo 



(n-r- £ ) 2 



[n - r - ^(n-r)(fc-l)~| - 1 



TABLE I 

Three algebraic decoders for7?.S(<3i; n, fc,d min ), their respective /(n, e, t)-functions, and eq(t). 



for error/erasure decoders and their respective /(n,e,r) are 
given in Table U 

Based on the DCF of a decoder, it is straightforward to 
calculate the maximal number of correctable errors for any 
given number of erasures in a received word, i.e. the maximal 
number e which fulfills the DCF for fixed r. We denote this 
number of errors by £o(t), where r, < r < d m i n — 1, 
is the number of erasures. Table [i] shows £o(t) for the three 
considered decoders. 

III. Optimal Erasing Strategy 

In this section, we shall solve the following basic problem 
of adaptive single-trial error/erasure decoding. Thereby, our 
technique is similar to |[T2l . 

Problem 1 For given received vector r (ro, . . . , 
with ordered unreliability values h a (rtf) > ••• > /i CT (r„_i) 
and channel state a, find the optimal number t*, < t* < 
d m - ln — 1, of erased most unreliable symbols, such that the 
residual codeword error probability of decoding r T * using a 
decoder with DCF /(n, e, r) is minimized. 

The foundation of our solution is basic probability theory 
and our aim is to express the residual codeword error proba- 
bility as a function of the number r of erased most unreliable 
symbols. In the following, we omit all subindices a for simpler 
reading. However, the reader should keep in mind that all 
functions and values depend on a. 

Given a received vector r, we define the binary random 
variables Xi, i = 0, . . . , n — 1, by 



r r . We denote their number by the discrete random variable 



X,. := 



1, if ri is erroneous (r.; ^ Cj) 
0, if ri is correct (ri — Ci) 



By definition of h(r{), the probabilities of the two possible 
outcomes of Xi are Pr(Xi = 1) = h(ri) and Pr(X; = 0) = 
1 — h(ri). Thus, the probability generating function (PGF) of 
Xi is 

GxAp) •=l-h{r i ) + ph{T i ). 

After erasing r symbols from r, there are e, < e < n — r, 
erroneous symbols within the non-erased n — r symbols of 



n-l 



Y T := Xi 



with PGF 



n-l 



GyAp) := \{G x% {p). 



Using the PGF of Y T , the probability for e errors in the 
Ti — t non-erased symbols can be calculated by 



Pr(Y r = e) 



p=0 



where the superscript ^ denotes the e-th derivative. 

Decoding r r is successful if e < £o( T ) an d fails otherwise. 
Hence, we can state the residual codeword error probability 
after decoding r r by 

n £o(t) 

P(t) := J2 Pr ^ = £ ) = 1 - E Pr ^ = £ )- W 

e=e { T ) + l e=0 

Consequently, the optimal choice of r (solving Problem [1} is 



t* := arg min {P{t)} 

0<T<tI m i n — 1 



arg 



£ (t) 

max < >^ Pr(Y T = e) 

0<T<d min -l ^— ' 



(4) 



E=0 



A closer look on the complexity of shows that it is 
in C(n 3 ), see [12| for a detailed explanation. The main task 
is to calculate Pr(Y T = 0),...,Pr(Y T = £o(r)) for all r, 
< t < c? m j n — 1 in order to obtain P(t). 

IV. Computationally Efficient Erasing Strategies 

In this section, we give two approximations of P(t) which 
eventually allow to calculate approximative r* with complex- 
ities in 0(n 2 y/n) and C(n 2 ), respectively. 



A. Approximation based on the Hoeffding Bound 

Given a set of independent random variables fulfilling 
certain properties, the Hoeffding bound lfl6ll states that the 
probability of this sum to assume values outside a small 
discrete interval I around its expectation is exponentially 
small. We show in [ 12 1 that the Xi, i = 0, . . . , n— 1, and their 
partial sum Y T have these properties, leading to the inequality 

Pr(|F T - E{Y T }\ >s)< 2exp (-^-\ , 

where s is a parameter that denotes the one-directional size 
t = s/ (n — t) of 

1 := [E{Y T } -t,..., E{Y T }, E{Y T } + t}. 

Setting s > \J — log(0.5 • 10~ 2 )2rt guarantees that the share 
of the probabilities Pr(F T = e) for e ^ I in the calculation 
© of P(r) is small, cf. Table HD 



Partial Sum 


Share in P(-r) 


E £e xPr(Y T = E ) 


> 99% 




< 1% 



TABLE II 

Percentual shares of the probabilities Pr(y r = e) IN P(t) for 
e e I and e X. 

As a result, all values e ^ 1 in Q can be neglected and a 
good approximation of the residual codeword error probability 
is obtained by 

min{B{Y T }+t,e (T)} 

P(t) « 1 - £ Pr ^ = £ )' 

E=roax{S{'K r }-t,0} 

In fl2l . the complexity of adaptive single-trial error/erasure 
decoding of binary codes with the Hoeffding approximation 
of P(t) is stated to be in 0(n 2 ^fn), this also holds for our 
case of (non-binary) RS codes. 

B. Approximation based on £q(t) 

For the second approximation, we require the following 
proposition. So far it is verified only by experiments and we 
are working on a proof. 

Proposition 1 For fixed r, < r < d min — 1, Pt(Y t = e) is 
a unimodal function in e and its mode is determined by the 
expectation E{Y T }. 

Fig. |2] shows a 3D plot of Pt(Y t = 0), . . . ,Pr(Y T = £ (t)) 
for all r, < t < d m i n — 1, each slice in e-direction 
is unimodal according to Proposition Q] and each E{Y T } 
coincides with the mode of the respective r. 

If the expectation is smaller than the error boundary, i.e. 
if E{Y T } < £o(t), then we can approximate the sum 
X]" =eo ( T ) + i P r (^r = e) by its largest element, which is 
Pr(y T ° = £ (t) + 1). Analogously, if E{Y T } > e (r), 




Fig. 2. Probabilities Pr(Y T = e) for 7£S(256; 255, 144, 112), AWGN@18 
dB, £o( T ) of the GS list decoder. 



then Pr(Y T = £q(t)) is a good approximation of the sum 
EeIo Pr (^r = e). Inserting into © gives 

J 1- Pr(F T = eo (r)), if S{y T } > e (r) 
~ \ Pr(Y T = e (r) + 1), if S{y r } < e (r) ' 

Based on the complexity analysis of the Hoeffding approx- 
imation in lfl2l . it is easy to see that the complexity of the 
£o(t) approximation is in 0(n 2 ). Since there are no practical 
decoders with lower complexity than 0(n 2 ), adaptive single- 
trial error/erasure decoding is in the same complexity class 
as the error/erasure decoder itself and the computation of r* 
increases complexity only by an additive constant. 

V. Simulation Results 

We investigate the potential of adaptive single-trial er- 
ror/erasure decoding for the RS code ^5(256; 255, 144, 112). 
Two error/erasure decoders are considered: The classical BMD 
decoder (e.g. Berlekamp-Massey or Sugiyama) and the GS list 
decoder with multiplicities v — > oo. 

Performance evaluation is done by simulation and by semi- 
simulative upper bounds of the residual codeword error prob- 
ability. For each considered Eb/No, we calculate an average 
unreliability vector h := (ho, . . . , /j n -i)> ho > ■ ■ ■ > h n -i, 
by averaging over 10 4 random unreliability vectors. For each 
variant of P(t) (exact, Hoeffding approximation, £o(t) ap- 
proximation), we calculate f := r* for h according to 
We use f for every received vector, which means that the 
simulation is in fact non-adaptive, using the optimal erasing 
strategy for the average unreliability vector. The resulting 
residual codeword error probability curves are indeed upper 
bounds, it is clear that the error probability can not be higher 
when t* is calculated for every single received vector. Precise 
error probabilities of errors-only decoding are obtained by 
inserting r = into (f3]). 

The £o( r ) approximation is considered in Fig. [3] It shows 
actual simulation results for Eb/No = 15, . . . , 16.5 dB and 
the aforementioned upper bound for Eb/No = 16.5, 17.75 
dB. Clearly, adaptive single-trial error/erasure decoding with 
a classical BMD decoder yields a gain of approximately 0.2 
dB for practical error probabilities. For the GS list decoder, 
the achievable gain is negligible. The reason for this lies 
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Fig. 3. Residual codeword error probability vs. Eb/NO for different de- 
coders, 7^.5(256; 255, 144, 112). Adaptive single-trial error/erasure decoding 
is based on the so(t) approximation. 

in the non-linearity of the GS list decoder's £o( T ) function, 
whose slope gets steeper with decreasing r. This means that 
the benefit of transforming errors into erasures diminishes for 
a small number of erased symbols. The residual codeword 
probability curve of Forney's original z ps d m i n /2— trial GMD 
decoding Q, (2) is given as a reference. Fig. [3] shows that 
most of GMD's gain can be achieved by a single adaptive 
trial, if only the erasing strategy is chosen optimally. 

Fig. |4] shows for an interesting range of residual codeword 
error probabilities that there is virtually no difference between 
exact calculation of P(t) and the two proposed approxima- 
tions. Our recommendation is to use the £q{t) approximation 
whenever exact calculation of P(t) is prohibitive. It is feasi- 
ble both in terms of computational complexity (0(n 2 )) and 
approximation quality. 
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Fig. 4. Comparison between the exact optimal erasing strategy and the two 
proposed approximations, 7£iS(256; 255, 144, 112). 

VI. Conclusions 

Classical error/erasure BMD decoders for RS codes are 
widely deployed. We presented an adaptive single-trial er- 
ror/erasure decoding technique, which allows to decrease 



the residual codeword error probability of such decoders 
using a low-complexity, i.e. (D(n 2 ), pre-computation step. 
The achievable gain of our technique is around 0.2 dB for 
KS{256; 255, 144, 112). This is slightly less than the gain of 
GMD decoding but neither does it require a modification of 
the decoder itself (Kotter's fast GMD decoder nor does 
it require z s» d m - m /2 decoding trials (Forney's original GMD 
decoder (T), G))- Our technique is general, it can be applied 
to any error/erasure decoder as long as its DCF is known. 
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